Acknowledge reception of data in `TrinoResult` #220

mdesmet · 2022-08-12T23:44:26Z

Description

Ensures the received data is properly acknowledged by calling the next_uri. This will avoid seeing failed queries in the query log when executing scalar queries as in the following example.

cur.execute("SELECT VERSION()")
cur.fetchone()
cur.cancel()

Release notes

( ) This is not user-visible and no release notes are required.
( ) Release notes are required, please propose a release note for me.
(x) Release notes are required, with the following suggested text:

## Breaking Changes

* Make the `execute` method of the cursor block until at-least one row is received.
  This means users no longer need to call `fetchone` or `fetchall` to make sure query
  actually starts executing on the Trino server. Note that results still need to be consumed
  by calling `fetchone` or `fetchall` to make sure query isn't considered idle and terminated
  on the server. ([#232](https://trinodb/trino-python-client/issues/232))

* Properly propagate query failures to the client when using `fetchone`.
  ([#95](https://trinodb/trino-python-client/issues/95))
* Fix queries returning a single row from sometimes appearing as failed on the server.
  ([#220](https://trinodb/trino-python-client/issues/220))

trino/client.py

mdesmet · 2022-08-14T10:33:50Z

The CI issues can be simulated by following code. Seems that although the result has returned the Trino API still provides one or more next_uri's to fetch in a minority of cases.

def test_query_cancellation_not_triggered(trino_connection):
    count_not_finished = 0
    for _ in range(0, 1000):
        cur = trino_connection.cursor()
        cur.execute("SELECT VERSION()")
        cur.fetchone()
        if not cur._query.finished:
            count_not_finished += 1

    print(str(count_not_finished) + " unfinished queries")

Some questions:

can we handle this on the serverside to not let this type of queries fail if all results have been consumed?
Why do we mark a cancelled query as failed if it's a valid use case to only retrieve a number of results and bail out?

hashhar · 2022-08-16T11:42:48Z

Why do we mark a cancelled query as failed if it's a valid use case to only retrieve a number of results and bail out?

It's not valid. It's a hack that BI tools and clients use instead of limiting their query to pull only what is needed.
e.g. As a cluster admin it's very useful to see clients who run a query that returns billions of rows but just take 100 rows and leave the query hanging (instead of either cancelling or consuming results) which means that the output buffer on the server (and other processes) keeps occupying memory until the query times out.

can we handle this on the serverside to not let this type of queries fail if all results have been consumed?

You know that version() would return one row but the server does not since results are streamed back to coordinator from workers and coordinator can't know that there isn't more data coming until it asks workers about it.

Change your experiment to a query which returns arbitrary number of rows and then you can't know anymore whether query is finished or not. Special casing the client protocol for queries which return single row doesn't seem useful.

hashhar · 2022-08-16T11:43:33Z

The alternative is to have a client protocol which is based on persistent TCP connections instead of HTTP long polling - which brings it's own set of problems.

hashhar · 2022-08-16T11:47:52Z

Also, specifically on why you cannot assume an empty rows being returned from API as proof that query has finished is the pipelined execution model. In queries it's possible for Trino to perform both table scans and output results at the same time. e.g. If you have a long chain of UNION ALL statements then Trino can start returning results as soon as the first UNION ALL query is done while other parts of the query are still executing .

This means that the client might observe periods of time where there is no data returned but a nextUri is still included. If the client were to assume no data == query finished then it'll drop any upcoming rows that will be produced.

nineinchnick · 2022-08-16T11:51:08Z

@hashhar would you agree that to address the original issue we should add a fetch call after scalar() to drain the cursor? We might also want to document this somewhere. I don't think we should try to make the driver too smart to work around these protocol limitations.

trino/client.py

hashhar · 2022-08-16T12:01:41Z

@nineinchnick I don't agree 100%. It's still useful to make the client as smart as the JDBC driver where the implementation detail of the Trino REST protocol isn't visible to users.

But yes, since this might take more time than the quickfix I think we should go with the quickfix for now and then think about how to stop the protocol from leaking into user code.

mdesmet · 2022-08-16T22:42:51Z

@hashhar would you agree that to address the original issue we should add a fetch call after scalar() to drain the cursor? We might also want to document this somewhere. I don't think we should try to make the driver too smart to work around these protocol limitations.

This is exactly what sqlalchemy does when you scalar_one or scalar_one_or_none. Again we already fetch the next row in client module's fetch now, but even with fetching another record we have no guarantee that would finish the query (nextUri set to null) as @hashhar said.

https://github.com/sqlalchemy/sqlalchemy/blob/f8c4dba4e9f130c18ce00597c036bc26ae7abf90/lib/sqlalchemy/engine/result.py#L745-L748

trino/client.py

mdesmet · 2022-09-17T15:34:26Z

However this is also a breaking change (even though it improves experience) so I'm approving it but not merging until we release a version from current master so that people can upgrade to that version and then choose to decide whether they want to stay there for sometime before migrating to new blocking API.

Can you explain why you think this is a breaking change? IMHO the only way to break existing usage is if users didn't catch exceptions on execute(), which is a bug already as query submission can throw the same exceptions as fetch*(). The fetch*() operations continue to work as before, as proven by the integration tests.

Note that sqlalchemy doesn't call fetch*() on DML operations. Bringing this in would fix that.

And now that we are taking this direction it might be wise to decouple the Trino API handling from the db-api client and instead make available the fetched rows to the db-api cursor via a queue (list) instead of it directly fetching things from API. That gives future flexibility to introduce performance enhancements as well like the double-bufferring that the JDBC driver does for example and also make it possible to provide different cursor implementations.

IMHO this is already decoupled. The TrinoQuery exists in client module and the exposes a lazy collection (an Iterator powered by a generator). I think this is the correct abstraction to use. I don't see leakage of the API details being introduced in this PR, actually the opposite is true: I would argue that in current code execute() is only query submission while fetch*() is query execution and result set scrolling, while this PR makes execute() query submission and execution and fetch*() result set scrolling which seems semantically more correct and in line with other dbapi implementations.

The double buffering as in the java Trino client, is implemented in this PR. Note that also the Java client doesn't use threading at this moment. I think it is a good idea to investigate but can be done independent from this PR.

I don't see why cursor implementations are impacted, cursors would take the Iterable and convert it for example in a dict instead of a tuple (many dbapi implementations have a DictCursor).

hashhar · 2022-09-20T20:14:36Z

Can you explain why you think this is a breaking change?

Because the API is now blocking.

I don't see leakage of the API details being introduced in this PR, actually the opposite is true:

I don't mean that this PR leaks the API details. I meant the opposite. Now we are one step closer to hide the API details within TrinoQuery and TrinoResult. An example of what this PR allows to do (but probably doesn't make sense) is to have a different impl of TrinoQuery which can probably use a different fictional transport mechanism to talk to Trino (instead of the REST API).

The double buffering as in the java Trino client, is implemented in this PR.

True

Note that also the Java client doesn't use threading at this moment. I think it is a good idea to investigate but can be done independent from this PR.

Exactly what I said above.

I don't see why cursor implementations are impacted, cursors would take the Iterable and convert it for example in a dict instead of a tuple (many dbapi implementations have a DictCursor).

Again exactly what I said your PR allows us to do in future.

trino/sqlalchemy/dialect.py

hashhar · 2022-09-20T20:22:18Z

Newer Trino versions will include trinodb/trino#14122 which can mean CI can be green without this change.

I think we should add one more entry to matrix with 395 as the version being tested for the meantime.

mdesmet · 2022-09-21T13:40:08Z

Newer Trino versions will include trinodb/trino#14122 which can mean CI can be green without this change.

I think we should add one more entry to matrix with 395 as the version being tested for the meantime.

I added the entry.

hashhar

Acknowledge reception of data in TrinoResult fails without Execute should block until at least one row is received

So it seems we need to squash those.

LGTM otherwise.

A query only transitions to a FINISHED state when the results are fully consumed. The reception of the data is acknowledged by calling the next_uri before exposing the data through dbapi. `dbapi.execute()` will now block until the first result is received. This will ensure the result set is exhausted for DML queries and as such remove the need to for calling `dbapi.fetchall()`.

mdesmet · 2022-09-22T16:44:48Z

@hashhar: Squashed those commits. Are we good to go?

hashhar · 2022-09-22T16:51:51Z

Thanks. Good to go.

cla-bot bot added the cla-signed label Aug 12, 2022

hashhar mentioned this pull request Aug 13, 2022

Fix issue of queries not reaching completed state #210

Closed

nineinchnick approved these changes Aug 13, 2022

View reviewed changes

trino/client.py Outdated Show resolved Hide resolved

mdesmet requested review from hashhar and ebyhr August 15, 2022 14:15

mdesmet force-pushed the bug/consume-results branch 2 times, most recently from e35d986 to 5d59610 Compare August 16, 2022 11:19

hashhar reviewed Aug 16, 2022

View reviewed changes

trino/client.py Show resolved Hide resolved

mdesmet force-pushed the bug/consume-results branch from 5d59610 to 4447b2e Compare August 16, 2022 22:50

mdesmet requested a review from hashhar August 18, 2022 10:56

mdesmet force-pushed the bug/consume-results branch from 4447b2e to 9d898a8 Compare September 1, 2022 11:40

mdesmet mentioned this pull request Sep 1, 2022

Add isort (for sorting imports) #227

Merged

mdesmet force-pushed the bug/consume-results branch from d206e80 to d358954 Compare September 13, 2022 14:13

mdesmet marked this pull request as draft September 13, 2022 14:18

mdesmet force-pushed the bug/consume-results branch 4 times, most recently from 2bec10e to fb66f9c Compare September 13, 2022 22:38

mdesmet marked this pull request as ready for review September 13, 2022 22:39

mdesmet requested review from hovaesco and lpoulain September 14, 2022 07:46

hovaesco reviewed Sep 14, 2022

View reviewed changes

trino/client.py Outdated Show resolved Hide resolved

mdesmet mentioned this pull request Sep 14, 2022

execute does not kick off query on Trino Server #232

Closed

1 task

mdesmet force-pushed the bug/consume-results branch from ebdc096 to a03c5a9 Compare September 16, 2022 14:47

mdesmet mentioned this pull request Sep 19, 2022

Add JSON functionality to SQLAlchemy Dialect #214

Merged

hashhar approved these changes Sep 20, 2022

View reviewed changes

trino/sqlalchemy/dialect.py Show resolved Hide resolved

Use scalar_one* to exhaust result set in sqlalchemy

15cf185

mdesmet force-pushed the bug/consume-results branch from a03c5a9 to 29fb5fa Compare September 21, 2022 13:34

hashhar approved these changes Sep 21, 2022

View reviewed changes

mdesmet added 3 commits September 21, 2022 21:59

Fix mutable constructor argument

7b3145f

Add Trino 395 to ci matrix

4858d6b

mdesmet force-pushed the bug/consume-results branch from 96e7c68 to 4858d6b Compare September 21, 2022 20:15

hashhar merged commit 5bdc073 into trinodb:master Sep 22, 2022

This was referenced Sep 22, 2022

Trino-python-client not returning/capturing External Errors when using fetchone #95

Closed

Add 0.317.0 release notes #238

Merged

mdesmet deleted the bug/consume-results branch September 22, 2022 19:39

MichaelTiemannOSC mentioned this pull request Oct 15, 2022

Trino client issue: "This result object does not return rows. It has been closed automatically" os-climate/os_c_data_commons#214

Closed

This was referenced Oct 20, 2022

Cannot create table prestodb/presto-python-client#84

Open

Trino and Presto hooks do not properly execute statements other than SELECT apache/airflow#26774

Closed

Bump Trino version to fix non-working DML queries apache/airflow#27168

Merged

hashhar mentioned this pull request Nov 21, 2022

SQLAlchemy 2.0 Compatibility #291

Closed

1 task

hashhar mentioned this pull request Jul 21, 2023

How to get infoUri before query finishes #388

Open

1 task

dungdm93 mentioned this pull request Aug 3, 2023

Trino queries cannot be stopped in SQL Lab apache/superset#24858

Closed

3 tasks

giftig mentioned this pull request Aug 3, 2023

fix(sqllab): Force trino client async execution apache/superset#24859

Merged

9 tasks

dungdm93 mentioned this pull request Aug 8, 2023

add an option to deferred fetch result in Cursor.execute() #400

Open

villebro mentioned this pull request Oct 5, 2023

fix: revert fix(sqllab): Force trino client async execution (#24859) apache/superset#25541

Merged

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Acknowledge reception of data in `TrinoResult` #220

Acknowledge reception of data in `TrinoResult` #220

mdesmet commented Aug 12, 2022 •

edited by hashhar

Loading

mdesmet commented Aug 14, 2022 •

edited

Loading

hashhar commented Aug 16, 2022

hashhar commented Aug 16, 2022

hashhar commented Aug 16, 2022

nineinchnick commented Aug 16, 2022

hashhar commented Aug 16, 2022

mdesmet commented Aug 16, 2022

mdesmet commented Sep 17, 2022

hashhar commented Sep 20, 2022

hashhar commented Sep 20, 2022

mdesmet commented Sep 21, 2022

hashhar left a comment

mdesmet commented Sep 22, 2022

hashhar commented Sep 22, 2022

Acknowledge reception of data in TrinoResult #220

Acknowledge reception of data in TrinoResult #220

Conversation

mdesmet commented Aug 12, 2022 • edited by hashhar Loading

Description

Release notes

mdesmet commented Aug 14, 2022 • edited Loading

hashhar commented Aug 16, 2022

hashhar commented Aug 16, 2022

hashhar commented Aug 16, 2022

nineinchnick commented Aug 16, 2022

hashhar commented Aug 16, 2022

mdesmet commented Aug 16, 2022

mdesmet commented Sep 17, 2022

hashhar commented Sep 20, 2022

hashhar commented Sep 20, 2022

mdesmet commented Sep 21, 2022

hashhar left a comment

Choose a reason for hiding this comment

mdesmet commented Sep 22, 2022

hashhar commented Sep 22, 2022

Acknowledge reception of data in `TrinoResult` #220

Acknowledge reception of data in `TrinoResult` #220

mdesmet commented Aug 12, 2022 •

edited by hashhar

Loading

mdesmet commented Aug 14, 2022 •

edited

Loading